iXUPT: Indexing XML Using Path Templates
نویسندگان
چکیده
The XML format has become the standard for data exchange because it is self-describing and it stores not only information but also the relationships between data. Therefore it is used in very different areas. To find the right information in an XML file, we need to have a fast and an effective access to data. Similar to relational databases, we can create an index in order to speed up the querying for the information. There are several ways of indexing XML data but previous research showed that one of the most effective approaches is to index root-to-leaf paths in the input file. So we took the inspiration from existing path-based indexing concepts, enhanced those ideas, and created a new native XML indexing method derived from the combination of existing approaches in order to improve the evaluation time of regular path expressions in XPath queries.
منابع مشابه
Okapi-based XML indexing
Purpose – Being an important data exchange and information storage standard, XML has generated a great deal of interest and particular attention has been paid to the issue of XML indexing. Clear use cases for structured search in XML have been established. However, most of the research in the area is either based on relational database systems or specialized semi-structured data management syst...
متن کاملCtree: A Compact Two-level Bidirectional Tree for Indexing XML Data
Indexing XML data to facilitate query processing has been a popular subject of study in recent years. Most of previous studies can be classified into three categories: path indexing, node indexing and sequence-based indexing. Many of them cannot answer both single-path and branching queries with various value predicates very efficiently. In this paper, we propose a novel compact tree (Ctree) st...
متن کاملImplementation and Evaluation of a Path Indexing Framework for Large Collections of Interlinked XML
In the Semantic Web, the extension of the current Web, XML documents are used to provide information together with a well–definded structure to support effective data discovery and automation. Some years ago, they have mostly been used for exchanging data between different applications, hence the complete information was contained in a single document. Nowadays, as XML is more and more used as ...
متن کاملFliX: A Flexible Framework for Indexing Complex XML Document Collections
While there are many proposals for path indexes on XML documents, none of them is perfectly suited for indexing large-scale collections of interlinked XML documents. Existing strategies lack support for intraor inter-document links, require large amounts of time to build or space to store the index, or cannot efficiently answer connection queries. This paper presents the FliX framework for conn...
متن کاملIndexing XML Data with UB-trees
Using the terminology usual in databases, it is possible to view XML as a language for data modelling. To retrieve XML data from XML databases, several query languages have been proposed. The common feature of these languages is the use of regular path expressions. Users are allowed to navigate through arbitrary long paths in the data by regular path expressions. Several index structures for XM...
متن کامل